
BA C K GROUND QF THg INVENTION 

1. Field of the Invention 

5 The invention relates to signal processing systems 

and methods and, particularly, to techniques for normalizing 
signal levels in a signal processing system. 

2 . Description of the Prior Art 

10 

Many electronic systems configure and format 
signals by linking together a series of signal segments 
obtained from a number of different sources. In such 
systems, it is usually important that the signal levels at 
15 the different sources be matched. Conventional television 

and radio transmission systems are notable examples. 

For example, subscriber television systems, such 
as cable television (CTV) , normally deliver programs that 

20 are formed from a number of successive segments that 

originate at different sources. Many cable channel 
programmers set aside approximately four minutes in two 
blocks each hour for local advertising insertion. These 
advertisement blocks are sold by the local cable operator or 

25 by an advertising consortium of several cable systems. The 

cable operator automatically inserts the advertising in, for 





example, a satellite -delivered program coming from the 
programmer. The insertion is usually done locally under the 
control of cue tones transmitted by the programmer. At 
these specific cues, the cable operator switches different 
5 audio and/or video programming. Consequently, cable 

operators frequently encounter the problem of matching the 
audio and/or video levels between the different sources. 
This problem is particularly acute in CTV systems where the 
system performs automatic switching with no human operator 
10 to adjust levels. 

In many prior art CTV systems, ad- insertion is 
handled by a combination of cue tone detectors, switching 
equipment and tape players which hold the advertising 

15 material. Upon receipt of the cue tones, a CTV insertion 
controller automatically turns on a tape player containing 
the advertisement. Switching equipment then switches the 
system output from the video and audio signals received from 
the programming source to the output of the tape player. 

20 The tape player remains on for the duration of the 

advertising, after which the insertion controller causes the 
switching equipment to switch back to the video and audio 
channels of the programming source. When switched, these 
successive program and advertising segments usually feed to 

25 a radio- frequency (RF) modulator for delivery to the 
subscribers . 



30 



Many subscriber television systems, such as CTV 
systems, are currently being converted to digital equipment. 
In the future, video file-server systems will replace many 




of the conventional tape players. These new digital systems 
compress the advertising data, e.g., using Motion Picture 
Experts Group 2 (MPEG2) compression, store the compressed 
data as a digital file on a large disk drive (or several 
5 drives) , and then, upon receipt of the cue tone, spool 

("play") the file off the drive to a decompressor. The 
video and accompanying audio data are decompressed back to 
standard video and audio, and switched into the video/audio 
feed of the RF modulator for delivery to the subscribers. 

10 

One of the most critical problems confronting 
designers of CTV systems and other similar transmission 
systems, has been normalizing the audio and video levels 
between the programming and the advertising. It is 
15 generally known that many subscribers have complained for 

years that the audio sounds higher during commercials than 
during programming. Although the audio during commercials 
can also sound low compared to the program level, few people 
complain in that case. 

20 

Consequently, those concerned with the development 
of radio, television, data, control and equivalent 
transmission systems have recognized the need for more 
effective signal-level normalization techniques. The 
25 present invention fulfills this need. 

SUMMARY QF TBS INVENTION 

Therefore, it is an object of the invention to 
30 provide an improved signal -level normalization technique for 




use in signal transmissions such as audio, video and data 
transmissions . 

It is another object of the invention to provide 
signal -level normalization technique particularly suitable 
for use in digital networks. 



A further object of the invention is the provision 
of systems and methods of normalizing the signal level in a 
first signal block, such as the audio and/or video levels in 
a television program or a commercial, based on comparable 
5 measured levels of a second signal block, such as a prior 
program or a preceding commercial . 

Still another object of the invention is the 
provision of systems and methods of normalizing the signal 
10 level in a signal block based on previously measured levels 

of the signal block. 

Yet a further object of the present invention is 
the provision of a signal insertion technique capable of 
15 performing quality checks, such as verifying that the proper 

commercial was inserted into a predetermined location in the 
advertising block of a program. 

According to the invention, a signal processing 
20 system having a normalized output signal comprises a first 

signal source and a second signal source. A signal combiner 
connects to the first and second signal sources for forming 
an output signal by linking signal segments derived from the 
first and second signal sources into a series of the signal 
25 segments. A level processor connects to the signal combiner 

for determining a level of intensity of the output signal. 
A level adjuster connects to at least one of the signal 
sources and responds to the level processor for adjusting a 
level of intensity of the signal segments* at least one of 




the signal sources such that the level of intensity of the 
output signal is normalized. 

More specifically, the invention provides a signal 
5 transmission system for producing a composite output signal 

formed by linking signals from a plurality of signal sources 
comprising a first signal source for generating a series of 
first segments and cue tones indicating the borders of the 
first signal segments, and a second signal source connected 

10 to the first signal source. Responsive to the cue tones, 

the second signal source generates a series of second signal 
segments. The first and second signal segments include 
audio signals. A signal combiner is connected to the first 
and second signal sources for forming the composite output 

15 signal by alternately linking the first signal segments with 

the second signal segments. A level processor connects to 
the signal combiner for determining the loudness of the 
audio portion of the composite output signal. A level 
adjuster connects to at least one of the signal sources and 

20 responds to the level processor for adjusting the loudness 

of signal segments from the signal sources such that the 
volume of the output signal is normalized. 

Still another aspect of the invention involves a 
25 signal transmission method for transmitting an output signal 

by generating composite output signals in each of a 
plurality of signal channels. The composite output signals 
in each channel is formed by generating a series of first 
signal segments and cue tones indicating the borders of the 
30 first signal segments, and generating a series of second 
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signal segments in response to the cue tones. Each channel 
output is formed by alternately linking the first signal 
segments with the second signal segments at the borders in 
response to the cues tones. The channel outputs from each 
5 channel are combined for simultaneous transmission to users. 

The method includes the step of determining a level of 
intensity of signals in a first channel output and adjusting 
a level of intensity of each of the channel outputs as a 
function of the intensity in the first channel output such 
10 that the levels of intensity of the channel outputs are 

normalized . 

These and other objects, features and aspects of 
the invention will be more clearly understood and better 
15 described if the following detailed description is read in 

conjunction with the appended drawings wherein: 



BRIEF DESCRIPTION OF THE DRAWINGS 

FIG\^1 is a system block diagram of a cable 
television head-end system constructed in accordance with a 
preferred embodiment of the invention. 



20 
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FIG>2 is a graph of loudness of an audio signal 
25 on a logarithmic scale vs. time, which is useful in 

understanding the preferred embodiment of FIG. 1. 

FIG. 3 is a detailed block diagram of a cable 
television head-end system constructed in accordance with 
30 the preferred embodiment of FIG. 1. 
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FIG> N 4^is a graph, similar to that of FIG. 2, 
showing loudness of an audio signal on a logarithmic scale 
vs. time, which is useful in understanding the preferred 
embodiment of FIGs . 1 and 3 . 



FIGn Ss 5A is a flow chart illustrating process steps 
performed by the preferred embodiment of FIGs. 1 and 3* 



FIG.,. 5B is a flow chart, similar to that of 

\ 

10 FIG. 5A, illustrating alternate process steps performed by 

the preferred embodiment of FIGs. 1 and 3. 



FIG. 6 is a detailed block diagram, similar to 
that of FIG. 3^ of an alternate embodiment of the invention, 



FIG. 7 is a graph, similar to that of FIG. 4, 
showing loudness of an audio signal on a logarithmic scale 
vs. time, which ^is useful m understanding the embodiment of 
FIG. 6. 



FIG. 8 is a flow chart illustrating process steps 

\ 

performed by the embodiment of FIG. 6. 



FIGk 9 is a system block diagram, similar to that 
\ 

25 of FIG. 1, of a x cable television system constructed m 

accordance with another alternate embodiment of the 
invention . 
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FIG. 10 is a detailed block diagram, similar to 
that of FIG.^6v^of the alternate embodiment of the invention 
shown in FIG. 9. 



FIG. 11 is a detailed block diagram of a 
commercial verification system in accordance with the 
present invention . 



FIG. 12 is a flow chart illustrating process steps 

x 

10 performed by dhe system of FIG. 11. 

FIG. 13 is a detailed circuit schematic with 
parameter values for the various elements thereof, 



representing a ^particular implementation of a level detector 

\ 

15 that forms a part of the various preferred embodiments of 

the present invention. 



FIG. 14 is a graph showing a plot of response in 

\ 

decibels (dB) v^. frequency for comparing test results for 
20 the circuit of FIG. 13 with a conventional A-weighting curve 

which closely resembles the frequency response of a typical 
human ear. 



DETAILED DESCRIPTION QF THE PREFERRED EMBODIMENTS 

25 

Referring now to the drawings, FIG. 1 shows a 
general block diagram of CTV system 20. It is to be 
understood that the particular CTV systems described herein 
are exemplary and that the invention has application in 
30 other equivalent audio, video, data and other equivalent 
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transmission systems. CTV system 20 includes earth station 
receiver 21 which provides decoded program material, both 
video and audio, at its output lines 22. Earth station 
receiver 21 comprises conventional receiver and decoder 
5 equipment. A typical receiver 21 normally outputs analog 

video (e.g., in the National Television System Committee 
(NTSC) format), plus one to three channels of audio. The 
audio channels are most commonly left- and right -channel 
stereo and possibly a third monaural channel for a second 
10 audio program (SAP), often in another language. Many cable 

operators contemplate replacing these analog systems with 
more modern digital CTV systems that output digital video 
and/or audio signals from receiver 21. 

15 In addition to the video and audio channels, 

receiver 21 transmits cue tones on line 23 to ad- insertion 
system 24 which uses the cue tones to generate timing 
signals for controlling the insertion of advertising into 
the program material. In most systems, the cue tones are a 

20 series of dual-tone multiple- frequency (DTMF) tones which 

identify the programmer and the insertion times in the 
ad- insertion process. Typically, different cues are 
transmitted for pre -roll (an advance in time to allow a tape 
machine to get up to speed) ; transf er-to-ad (a time at the 

25 beginning of the advertisement block) and return (the 
conclusion of the advertisement block) . 

Timing signals, which ad-insertion system 24 
outputs onto lines 34, operate transmission switch 25. 
30 Ad- insertion system 24 generates these timing signals in 
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response to the cue tones received on line 23. When 
operated, switch 25 routes either the program audio and 
video, which appear on lines 22, or the advertising video 
and audio, which appear on lines 27, to modulator 3 0 via 
5 lines 33. Modulator 30 modulates the video and audio onto 

an RF carrier that it transmits to subscribers along with 
all other channels via combiner 28. Modulator 30 will often 
include other equipment that receives and processes the 
signals, such as scramblers and stereo encoders or the like. 

10 

Consequently, in response to appropriate cue tones 
on line 23, ad- insertion system 24 automatically inserts 
commercials into advertising blocks by spooling advertising 
material onto lines 27 and, hence, modulator 3 0 via 

15 switch 25 and lines 33. To provide normalization between 

the audio and/or video levels of the commercials with that 
of the program, level processor circuit 2 6 monitors the 
signals being switched onto lines 33. Level processor 
circuit 26 adjusts, via lines 31, the appropriate signal 

20 levels of the advertising material being outputted by 

ad-insertion system 24. 

As described above, a primary problem addressed by 
this invention involves correcting a possible mismatch 
25 between the audio level coming from receiver 21 and that 

coming from ad- insertion system 24. Each apparatus 
represents a source of audio that is prepared at a different 
location by different people using different equipment. 
While efforts have been made in the past to normalize the 




audio levels from these different sources, limited success 
has been achieved. 

Typically, a level adjustment device resides 
5 within ad- insertion system 24 to allow operating personnel 

to manually or electronically correct for gain variations in 
the equipment. These level adjustment devices are often a 
source of possible errors due to frequent miss -adjustments . 
Still further, the gain of various pieces of system 
10 equipment will often shift with time and temperature. 

Yet another problem in the CTV industry is that 
the level coming from receiver 21 may be inconsistent from 
one channel to another. While most receivers and decoders 

15 used in the industry have fixed output levels for a fixed 
input at the uplink, different programmers operate uplinks 
with differing input levels. This is partially because 
different types of program material require different 
amounts of headroom, and different uplink engineers have 

20 different philosophies about the amount of headroom to 

allow. Because of this, level adjustments in ad- insertion 
system 24 have become necessary. 

FIG. 2, which plots loudness vs. time, illustrates 
25 the problem graphically. This graph shows the relative 

audio levels of four consecutive segments consisting of two 
program segments PI and P2, and an advertising block with 
two commercials CI and C2 located therebetween. Program 
segments PI and P2 and commercials CI and C2 have their 
30 audio levels varying over time. The first program 
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segment PI extends between times tl and t3 . An advertising 
block, starting with commercial CI, follows program 
segment PI. The graph shows commercial CI having a higher 
loudness than did the preceding program segment PI. Next in 
the advertising block is second commercial C2 . Because it 
was recorded differently than was first commercial CI, the 
graph shows it playing louder than commercial CI. The 
successive commercials CI and C2 extend between times t3 and 
t4. Finally, second program segment P2 follows second 
commercial C2 , and plays at the lower loudness level of 
program segment PI . 

The variations in loudness illustrated in FIG. 2 
usually produce the irritating phenomenon sometimes called 
"blasting 11 out the commercial. As mentioned above, 
broadcasters and cable operators have been accused of doing 
this intentionally to gain listener attention for the 
commercials. However, usually the source of the problem is 
more mundane: either the commercial automatically plays and 
no one is available to correct the audio level, or metering 
problems make an objective comparison of the levels 
difficult . 

FIG. 2 also shows the transmission times of the 
cue tones. The pre-roll cue tone arrives at time t2, which 
is usually about five to eight seconds before the commercial 
starts at time t3, to allow tape equipment to get up to 
speed before being played at time t3 . A transfer cue tone 
arrives at time t3 to insert first and second commercials CI 
and C2 into the advertising block. Finally, a return cue 



tone arrives at the end of the commercial break, time t4, to 
instruct the system to return to the program material 
supplied from receiver 21. 

5 Referring now to FIGs. 1-4 , a normalization 

technique in which the advertisement audio level is adjusted 
to match the level of the program audio will be described in 
detail. FIG. 3 depicts receiver 21 as having two channels, 
namely program video source 21V and program audio 
10 source 21A. Modulator 30 includes video modulator 3 0V and 

stereo encoder modulator 30A. FIG, 3 shows ad- insertion 
system 24 of FIG. 1 as having audio and video channels 
designated as advertisement video source 24V and 
advertisement audio source 24A. 

15 

FIG. 3 schematically depicts switch 25 as 
comprising a plurality of ganged switches, namely video 
switch 51 and audio switches 52, 53 and 54, being suitably 
switched by signals on lines 34. One input side of video 
2 0 switch 51 connects to program video source 21V. The other 

input side of video switch 51 connects to advertisement 
video source 24V. The output side of video switch 51 
connects to video modulator 30V. 

25 One input side of each of audio switches 52-54 

connects to respective left, right and SAP channels of 
program audio source 21A. The other input sides of audio 
switches 52-54 connect to the respective left, right and SAP 
channels of advertisement audio source 24A via respective 

30 attenuators 55, 56 and 57 which function as audio level 
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controls. The output sides of audio switches 52-54 connect 
to stereo encoder modulator 30A. 



The output sides of audio switches 52 and 53 also 
5 connect to adder 39 of level processor circuit 26 via 

lines 33. Because the preferred manner of evaluating the 
level of a stereo signal is to evaluate the sum of its left 
and right channels, adder 39 sums these stereo signals. 
Level detector 40 receives the summed output of adder 3 9 and 

10 inputs the inverting (-) side of comparison amplifier 32. 

Level detector 40 receives the audio signals at its input, 
and outputs a voltage corresponding to the loudness. The 
setting of switch 25 will determine which audio signals 
adder 39 sums. With respect to the graph of FIG. 2, the 

15 output of level detector 40 will be a voltage that varies, 

preferably as a decibel (dB) function, in accordance with 
the four curves representing the loudness of program 
segments PI and P2 and commercials CI and C2 . 

20 Audio reference level circuit 41, represented by a 

Zener-diode voltage source, provides a fixed reference 
level LI at the (+) input to comparison amplifier 32. Thus, 
comparison amplifier 32 outputs the difference between the 
actual volume level of the audio source, as detected by 

25 level detector 40, and the reference volume level LI which 

has been preset in reference level circuit 41. 
Analog- to-digital (A/D) converter 44 samples the output of 
comparison amplifier 32. Consequently, A/D converter 44 
outputs these sampled differences to logic 42 as a series of 

30 digital words proportional to a dB audio level. Logic 42 




communicates with a memory 43. As will be described below 
in detail, logic 42, which monitors the output of A/D 
converter 44, adjusts attenuators 55-57 via attenuator 
control lines 31 in accordance with error values calculated 
5 by logic 42. To perform its level adjusting functions, 

logic 42 derives control and timing signals from insertion 
controller 46, which is a conventional part of ad- insertion 
system 24 . 



10 In response to the cue tones transmitted from 

program video source 21V on line 23, insertion controller 46 
operates transmission switch 25 and sends control signals to 
logic 42, video source 24V and audio source 24A. Further, 
insertion controller 46 secures appropriate synchronization 

15 (sync) pulses and timing signals from sync separator and 

timing circuit 47, which is also a part of conventional 
ad-insertion system 24. Advertisement video source 24V uses 
the output of circuit 47 to synchronize its video with 
program video source 21V. Additionally, insertion 

2 0 controller 4 6 acquires appropriate timing signals from 

timing circuit 47 so that it can initiate spooling of the 
commercial when the appropriate pre -roll cue tone on line 23 
is received, for example, at time t2 of FIG. 2. Still 
further, insertion controller 46 operates transmission 

25 switch 25, via switch control lines 34, in accordance with 

the appropriate cue tones transmitted on line 23. 



It is noted that the SAP channel, if supplied, is 
controlled separately, based on its own characteristics. 
30 However, in the interest of clarity, this description and 




the related drawings omit the details of the SAP controls. 
However, it will be readily understood by those skilled in 
these arts that the SAP channel, and any other audio or data 
channels, may be treated in a similar manner as are the left 
5 and/or right stereo channels described herein. 

The operation of CTV system 2 0 will now be 
described with reference to the circuit of FIG. 3, the 
sample curves depicted in FIG. 4 and the FIG. 5A flow chart. 

10 As seen in FIG. 4, the audio level over any length of time 

is not constant; typical program audio may range from a 
whisper to the loud screech of an airplane taking off. 
Thus, it is necessary to be precise as to what is meant by 
the "level" of a program. For the present description, one 

15 preferred strategy is to match the last few seconds of the 

program audio level (right end of program segment Pi) with 
the first few seconds of the following commercial (left end 
of commercial CI) . Logic 42 performs this function by 
keeping a running log in memory 43 of "level" measurements 

20 based on an average of the last portion of the program 

material. At the time the program plays, this average 
becomes the "target level" to which the commercial audio 
level is matched. 

25 More specifically, upon start up of CTV system 20, 

logic 42 proceeds to monitor the output of A/D converter 44 
in monitor STEP 70 (see FIG. 5A) and continues to do so 
until the system is turned off. During the playing of 
program segment PI, transmission switch 25 is in its 

30 normally up position shown in FIG. 3. As such, adder 39 




will sum the left and right channels of program audio 
source 21A for program segment PI. Consequently, the output 
of comparison amplifier 32 will vary with the dB difference 
between program segment PI and fixed reference level LI. 

5 

Upon receiving the pre-roll cue tone at time t2, 
insertion controller 46 transmits a pre-roll trigger to 
logic 42 in trigger STEP 71, thereby causing it to store in 
memory 43 a series of output values from A/D converter 44 as 

10 indicated by store STEP 71. Logic 42 next receives, in 

trigger STEP 73, a transfer trigger from controller 46 at 
time t3 . At this point, logic 42 uses the series of 
previously stored values, in calculate STEP 74, to calculate 
and store the average output of A/D converter 44 between 

15 times t2 and t3 . This average represents the average 

difference D between program segment PI and reference 
level LI for the period t2-t3 (see the FIG. 4 graph) . Using 
this average difference D and reference level Ll, in 
calculate and store STEP 75, logic 42 calculates a target 

20 level and stores that value in memory 43 . 

It is noted that many conventional ad- insertion 
equipment know within a minute or two when a commercial is 
to be inserted, i.e., when to expect to receive a pre-roll 

25 cue tone. Because of this, the program segments may be 
monitored and averaged over a time period other than 
period t2-t3, which usually lasts only five to eight 
seconds. For example, the average difference D may be 
determined over a one or two minute period just prior to the 

3 0 advertisement block. 




As discussed above, at time t2 insertion 
controller 46 triggers advertisement video source 24V and 
audio source 24A to initiate the advertising process, i.e., 
to turn on the tape player to get it up to speed or, in the 
5 case of a digital system, to retrieve digital data from the 

appropriate file servers. Further upon receiving the 
transfer cue tone at time t3, insertion controller 46 
switches transmission switch 25, via lines 34, to the down 
position as viewed in FIG. 3. At this point, the 
10 advertisement video and audio channels are switched onto 

output lines 33 of switch 25 for transmission to the 
subscribers. As such, adder 39 now monitors and adds the 
left and right channels of advertisement audio source 24A. 

15 In calculate STEP 76, logic 42 calculates audio 

level errors En based on the current value of the target 
level and the output sample at the start of a commercial . 
For example, immediately after insertion controller 46 
transmits transfer trigger, i.e., at time t3+, logic 42 

20 calculates error El as the difference between the loudness 
level at the start of commercial CI and the target level. 
Logic 42 next performs adjust STEP 77 by applying the 
current audio level error El, via lines 31, to adjust the 
preset attenuators 55 and 56 in a direction that will cause 

25 the audio level of commercial CI to move down to the target 

level . 

In practice, reference level Ll represents the 
expected level for all advertisements and is normally 
30 obtained by recording (compressing) all commercials at a 
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nominally identical audio level equal to level LI. FIG. 4 
shows the average difference D and the audio level error El 
to be substantially equal, which illustrates the typical 
situation where commercial CI was actually recorded at the 
5 nominal or expected level LI. On the other hand, FIG. 4 

also shows commercial C2 as having been recorded slightly 
higher than the nominal or expected level Ll . 

Thus, if the audio sequence of program segment PI 
10 just prior to commercial time transmits louder than the 

expected or nominal level Ll, the level of commercial CI 
will be increased by adjusting attenuators 55 and 56. On 
the other hand, if the audio level of program segment PI is 
lower than the expected level Ll, as depicted in FIG. 4, 
15 logic 42 adjusts attenuators 55 and 56 to reduce the level 

of commercial CI down to the target level. 

Ideally this would be the end of the process. 
However, as a practical matter, it is not reasonable to 

20 expect that all commercials will be recorded at the "right" 

level. The commercials will be recorded at different times 
by different individuals using different indicating 
instruments, using different audio processing systems, and 
applying different ideas of how to interpret the audio 

25 level. Thus, the level of all commercials will usually not 

be the same. This is illustrated in FIG. 4 by the 
difference between the levels of commercials CI and C2 . 

Logic 42 continues to monitor the output of 
30 controller 46, waiting for the next trigger signal. In the 
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absence of a return cue tone, insertion controller 46 
signals advertisement video source 24V and audio source 24A 
to start playing commercial C2 at the end of commercial CI. 
At this time, controller 46 transmits a next -commercial 
5 trigger to logic 42 which exits the NO path of decision 

STEP 78 and proceeds to trigger STEP 79. In response to 
receiving the next -commercial trigger, logic 42 increments 
index n, in index STEP 80, and the process returns to 
calculate STEP 76. In calculate STEP 76, logic 42 
10 calculates the new audio level error E2, which represents 

the small additional amount that attenuators 55 and 56 must 
be adjusted, in adjust STEP 77, to move the level of 
commercial C2 down to the target level . 

15 When controller 46 receives a return cue from 

program video source 21V at time t4, it switches 
transmission switch 25 back to the normally up position of 
FIG. 3, thereby disconnecting advertisement video 
source 24V, and audio source 24A and reconnecting program 

20 video source 21V and audio source 21A to modulator 30. 

Controller 46 also transmits return trigger to logic 42 . In 
response, logic 42 exits decision STEP 78 on the YES path 
and returns the process back to monitor STEP 70. 

25 FIG. 5B, which is similar to FIG. 5A, depicts a 

modified normalization process in which the audio level 
offset of each commercial is stored on its first playing so 
that upon subsequent playing, the volume correction of that 
commercial may be set to an approximately correct value 

30 without the need to play, monitor and measure the 





commercial's level. In performing the FIG. 5B process on 
the program- commercial sequence of FIG. 4, logic 42 performs 
STEPS 70-75 to obtain the target level. At this point, 
logic 42 determines, in decision STEP 69, if the commercial 
5 about to be played, e.g. commercial CI, has been previously 

played. To do this, logic 42 looks up table entries in 
memory 43 that have been previously stored, listing 
commercial identification (ID) data, corresponding audio 
level errors Ep and its related target level. Controller 46 

10 provides logic 42 with the appropriate ID data. If this is 

the first playing of commercial CI, the process exits 
STEP 69 along the NO path to calculate error El, in 
calculate STEP 76, and proceeds to adjust STEP 77 to adjust 
attenuators 55 and 56. Next, logic 42 stores, in store 

15 STEP 68, appropriate table entries in memory 43, namely, the 

ID data for commercial CI, the corresponding error Ep equal 
to the current error El and its related target level. If 
logic 42 receives a return trigger in decision STEP 78, it 
exits the YES path and returns to monitor STEP 70 via index 

20 STEP 86 which increments index n. However, if logic 42 

should receive a next -commercial trigger in trigger STEP 79, 
the process returns to decision STEP 69, via index STEP 80 
which increments index n, and the process repeats for 
commercial C2 . 

25 

However, if the commercial to be played has played 
previously, the process exits the YES path from decision 
STEP 69. Logic 42 uses the commercial's ID data to accesses 
the appropriate table entry in memory 43. In calculate 
30 STEP 63, logic 42 calculates error Er using the current 




target level, error Ep, and its related target level. 
Logic 42, in adjust STEP 64, next adjusts attenuators 55 and 
56 using audio level error Er. The process then moves to 
decision STEP 78 and proceeds further in the manner 
5 described above . 



Pre-correcting the loudness setting for the 
commercials, as just described with respect to FIG . 5B, 
requires that tables be constructed for all commercials in 

10 the system. Since insertion controller 46 usually provides 

all commercials with ID data uniquely associated with that 
commercial, logic 42 can readily organize the table using 
that ID data. In the preferred embodiment, the table is 
duplicated for each channel on which the commercial airs. 

15 This will accommodate any differences in the gain of 

comparison amplifier 32 or other errors from channel to 
channel . 



Many other modifications and variations are 
20 possible in the light of the above teachings. For example, 

in order to allow the above-described level setting 
activities to proceed without undue computational stress 
being required of logic 42, level detector 40 is preferably 
configured to output the measured level differences in 
25 decibels (dB) , a well-known operation which converts ratio 

(division) operations into addition. Those skilled in these 
arts can readily design the output voltage of level 
detector 40 to be proportional to the dB level of the audio 
differences. FIG. 13, to be described below in detail, 
30 includes a specific implementation of level detector 40 with 



a dB output. Attenuators 55-57, therefore, attenuate the 
audio signals in a conventional dB relationship with respect 
to errors En. Although the conversion to dB representation 
may be performed intrinsically in level detector 40, as 
5 depicted herein, it is possible and often preferable to take 

the functions of audio reference level circuit 41 and 
comparison amplifier 32 into the digital domain by 
digitizing the output of level detector 40. 

10 A source of error in the audio level output to the 

subscribers occurs when the input sources at receiver 21 
change from time to time. Such will be the case, for 
example, when a channel is shared during different times. 
During the day the channel might be used for educational 

15 programming and during the evening the channel might be used 

for pay programming. This necessitates the switching of 
inputs to the receiver channel. If the audio levels from 
the two different sources are different, which is the rule 
rather than the exception, CTV system 20 1 , an alternate 

20 embodiment, normalizes these differences. 

FIG. 6 illustrates CTV system 20 1 which monitors 
the level of a program segment, e.g. program segment PI, and 
adjusts its level to match the nominal commercial reference 

25 level LI. Additionally, the commercials have their levels 

corrected, if necessary, as they begin to play. The 
advantage of this embodiment is that, if a program level 
changes as a result of uplink errors or system gain changes, 
the level correction subsystem automatically corrects the 

30 match for these errors and/or changes. 

-24- 





In addition to the elements that make up CTV 
system 20, CTV system 20' has three attenuators 65-67 in the 
program audio path. More specifically, CTV system 20 1 shows 
attenuators 65, 66 and 67 connected in the respective left, 
5 right and SAP channels of program audio source 21A. 

Attenuator control lines 38 connect logic 42 to the control 
terminals of program attenuators 65, 66 and 67. By adding 
program attenuators 65-67, CTV system 20 1 functions to 
normalize errors in the program audio levels. As such, CTV 
10 system 20 1 automatically normalizes unwanted variations in 

program levels that may be caused by, for example, 
inconsistent adjustments made by those who operate CTV 
head-ends, or audio gain variations in the head-end that 
change with time or temperature. 

15 

The audio level normalizing technique of CTV 
system 20 1 parallels that of CTV system 20, except that 
system 20 1 contains enough controls to allow the audio of 
program segments PI and P2, and the audio of commercials CI 

20 and C2 to be set to reference level LI. This process will 

now be described with reference to FIG. 6, the curves in 
FIG. 7 and the FIG. 8 flow chart. At the start, monitor 
STEP 90 initiates the monitoring of the difference values 
outputted by A/D converter 44. In define STEP 91, logic 42 

25 sets the stored value of the target level to be equal to 

reference level LI, which FIG. 7 shows to be within the 
range of commercial CI. Upon receiving pre-roll trigger, in 
trigger STEP 81, logic 42 performs store STEP 82, trigger 
STEP 83 and calculate STEP 84 to find the average 

30 difference D at the output of A/D converter 44 over time 





period t2-t3 (or other time period as described above) . 
Logic 42 uses this average difference D, which represents 
the difference between the target level (equal to reference 
level LI) and the audio level of the last portion of program 
5 segment PI, as an audio level error signal for adjusting 

program attenuators 65 and 66. FIG. 7 shows the original 
position of program segment P2 1 being shifted to the new 
position of program segment P2 as a result of these 
adjustments . 

10 

After adjusting program attenuators 65 and 66, in 
adjust STEP 87, logic 42 exits the NO path of decision 
STEP 88 and proceeds to set STEP 89. At this point, 
logic 42 stores in memory 43 the output of A/D converter 44 
15 as audio level error En, which at this point in the present 

example equals error El. Logic 42 adjusts, in adjust 
STEP 92, commercial attenuators 55 and 56 using audio level 
error El. Because it was assumed that commercial CI was 
recorded substantially at the nominal advertisement 

2 0 recording reference level LI, audio level error El equals 

zero, making attenuator adjustments at this point 
unnecessary. In the absence of receiving a return trigger, 
in trigger STEP 93, logic 42 follows the NO path to trigger 
STEP 95 via index STEP 94 where index n is incremented. 
25 Upon receiving the next -commercial trigger from 

controller 46, in trigger STEP 95, logic 42 stores in 
memory 43 audio level error E(n+1) as being equal to the 
current output of A/D converter 44. At this point in the 
present example, logic 42 stores audio level error E2 as the 

3 0 difference between the audio level at the start of 




commercial C2 and the target level, i.e., reference 
level LI. As such, adjust STEP 92 performs an adjustment to 
advertisement attenuators 55 and 56 so that commercial C2 
will play at or near reference level LI. This process is 
5 repeated for succeeding commercials. 

When logic 42 receives a return trigger from 
controller 46, at time t4 in decision STEP 93, it returns 
the process to trigger STEP 81 to await reception of the 

10 next pre-roll trigger at, for example, time t5 . This 

sequence takes place via the YES path of decision STEP 93. 
When controller 46 transmits the next pre-roll trigger at 
time t5, logic 42 proceeds through STEPS 81-84, thereby 
obtaining a new average difference D for the final portion 

15 of program segment P2 over time period t5-t6. FIG. 7 shows 

the new average difference D, over period t5-t6, to be 
significantly smaller than the difference D' that would have 
occurred absent the prior adjustments made to program 
attenuators 65 and 66 in adjust STEP 87 at time t3 . 

20 

The level adjusting procedures so far described 
measure the average level of the program segment over a 
short time period at the end of the segment, e.g., between 
the pre-roll and transfer cues. In the FIG. 7 scenario, 

25 this procedure results in program segment PI running at a 

level much lower than target level LI. It is also noted 
that in the embodiments so far described, logic 42 adjusts 
the attenuators, e.g., attenuators 55-57 and/or 65-67, 
within a relatively short time period that would normally go 

30 undetected by the viewer. For example, after the average 




level of program segment Pi in FIG. 7 is detected over time 
period t2-t3, the level of commercial CI is adjusted (if 
necessary) within a short period that should go undetected 
by the listening or viewing audience. However, these 
functions can be modified in some situations by measuring 
the average program levels over one or more other time 
periods as the program segment runs while making small 
periodic attenuator adjustments to gradually bring the 
program or commercial level into line with the target level. 
Using this modified procedure with the FIG. 7 scenario, the 
first program segment PI can be gradually brought into line 
with target level LI well before receiving the pre-roll cue 
at time t3 . The application of this steady level adjustment 
process will be described below in greater detail with 
respect to the embodiment of FIGs . 9 and 10. 



invention apply equally well to other types of signals, 
including video, data, control, etc. Video levels are 
usually easier to control than are audio levels because a 
well defined relationship exists between the desired picture 
information and its sync signal level. Because the sync 
signal level is normally consistent from one program segment 
to all others, it may be easily used to set video level. 
However, a few cases exist in which it is desirable to 
automatically set the level of the video signal other than 
using the sync signals. One important instance occurs when 
different video sources are being switched to a common 
output channel, or when video system gain changes over time. 



It has been noted above that the teachings of this 




Therefore, if level control attenuators are placed 
in the output of an advertisement video source, such as in 
lines 2 7 of FIGs . 3 and 6, they can be used to match the 
video level of a commercial to that of the program video 
5 source. A detector would detect the level of the video by 

measuring sync tip amplitude. It is known to those skilled 
in the art how to measure sync amplitude. In addition, if a 
second set of level control attenuators are placed in the 
program video source output, such as in lines 22 of FIGs. 3 
10 and 6, they could be used to correct for incorrect video 

level coming from that program video source. 

FIGs. 9 and 10 show yet another embodiment. Here 
CTV system 20 11 corrects audio-level differences that may be 

15 introduced by audio- level adjustments contained in 

conventional modulator equipment. A control device on a 
conventional modulator, e.g., modulator 30, is frequently 
labeled "DEVIATION, " because it controls the deviation of 
the sound carrier: the greater the audio level the greater 

20 the deviation. If the operators set the modulator 

deviations inconsistently from one channel to the next, the 
associated audio levels will change when subscribers are 
tuning channels. Thus, controlling the deviation (sound 
level) between channels of a multi-channel CTV system 20'', 

25 as depicted in FIGs. 9 and 10, creates further normalization 

problems . 

The normalization technique of CTV system 20 ,! 
parallels that of CTV system 20 1 of FIG. 6, with a notable 
30 exception that level detector 40 derives its audio input 
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signals form different points. CTV system 20 1 derives its 
audio level information from the individual subscriber 
channels, viz., lines 33. On the other hand, CTV 
system 20 ,? derives its audio level information from a point 
common to all subscriber channels. More specifically, while 
CTV system 20 1 uses the left and right audio channels 
outputted by receiver 21 and ad- insertion system 24 via 
adder 39, system 20' ' derives the audio from the main 
subscriber lines 101. 

Level processor circuit 26 1 of system 20' f , 
employs tuneable demodulator 60 to selectably monitor the 
audio outputs of the various subscriber channels. The audio 
input of demodulator 60 connects to the main subscriber 
lines 101 via directional coupler 61. When monitoring a 
particular channel, demodulator 60 tunes to the output of 
the appropriate channel modulator, e.g., modulator 30A or an 
equivalent audio modulator (not shown) in one of the other 
channels 100. A preferable implementation of demodulator 60 
includes a conventional agile television demodulator with a 
calibrated audio output and a tuner suitable for control by 
controller 46. In the television system used in North 
America (NTSC) , the audio on main subscriber lines 101 
deviates the audio carrier by 25 kilohertz (KHz) . In this 
regard, demodulator 60 is preferably calibrated such that 
its output voltage equals a known value for this 25 KHz 
deviation. As such, this known value corresponds to audio 
reference level LI. Because demodulator 60 produces a 
monaural sum signal, level processor circuit 26' does not 
include an adder similar to adder 39 of systems 20 and 20 1 . 
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For CTV system 20 1 the cue tones received by 
commercial insertion controller 46 transmit on line 23 from 
program video source 21V and on lines 23 1 from similar 
equipment (not shown) in the other channels 100. In 
5 addition to timing information, these cue tones include 
channel tuning information which insertion controller 46 
uses to direct the ad-insertion process of ad-insertion 
system 24 1 . In this regard, ad-insertion system 24 1 is a 
conventional system having multiple channels for 

10 simultaneously inserting commercials in more than one 

subscriber channel via lines 27 and 27'. Additionally, 
insertion controller 46 uses the channel tuning information 
to automatically tune audio demodulator 60 from one 
subscriber channel to another. When a program segment, 

15 e.g., program segment PI of FIG. 2, nears its end on any 

channel, commercial insertion controller 46 tunes audio 
demodulator 60 to that channel for performance of its 
appropriate normalization process. 

20 The audio level normalization control loop for 

each channel in CTV system 20' ' includes attenuators 55-57 
and 65-67, and stereo encoder and audio modulator 3 OA. 
These items will be unique to each channel. This 
normalization control loop also includes level processor 

25 circuit 26 1 which comprises demodulator 60, level 

detector 40, amplifier 32, A/D converter 44 and logic 42. 
These items are shared by all of the channels. Level 
processor circuit 26 1 normalizes the program audio from all 
channels, including other channels 100, and commercial audio 

30 being routed through all channels from ad- insertion 




system 24'. As such, CTV system 20' 1 offers a closed-loop 
control of audio level that includes normalization of 
mismatched levels due to several sources including the audio 
modulators . 

5 

Having the elements of tuner processor circuit 26' 
common to all channels both minimizes cost and promotes more 
consistency from one channel to another. The use of common 
equipment does, however, preclude the measurement of the 

10 audio level just before a commercial on a plurality of 

channels. Consequently, a queuing routine must be used for 
defining priority channels as the insertion process 
proceeds. The present invention contemplates that insertion 
controller 46 be programmed to define and select priority 

15 channels to which audio demodulator 60 is tuned just prior 

to a commercial. As mentioned above, conventional insertion 
controllers know the time that commercials will play within 
a minute or two in most cases. As such, some channels can 
be monitored between commercials and their audio deviation 

20 optimized for average audio over a period other than the 

short period just before the commercial (e.g., other than 
time period t2-t3) . In practice, this solution will produce 
sufficient normalization in most cases because audio levels 
do not change particularly fast. 

25 

Further, when commercials begin simultaneously on 
several subscriber channels (a common occurrence) , insertion 
controller 46 selects these channels sequentially according 
to pre-defined priority routines. Although insertion 
30 controller 46 may assign tuning priorities for audio 




demodulator 60 according to any number of queuing routines, 
including a random assignment routine, a preferred priority 
routine for making priority assignments is as follows: the 
channels to be given the highest priority are chosen from a 
random selection of the channels whose level offsets are 
unknown (i.e., they have never been measured and stored); 
followed by an ordered selection of the remaining channels 
based on the length of time that its commercial has last 
played, with those having the longer periods given the 
higher priority. Using this priority routine, commercials 
that have not yet played and, therefore, have not yet been 
normalized are given top priority. Because there is more 
opportunity for equipment level settings to drift over time, 
priorities are next assigned based on the length of time 
since the commercial last played; the one with the longest 
time period of non-play assigned the top priority. When 
there is a conflict in priorities, the routine makes random 
assignments from those involved in the conflict. 

Quality control functions are possible for systems 
of the type shown in FIGs. 9 and 10. It is known by viewers 
of cable television that the audio level from one channel to 
another is not always consistent. This is usually caused by 
head-end set-up errors. The set-up errors are, in turn, 
caused by excessive work load or insufficient training of 
personnel, or by limitations in equipment. By sequentially 
monitoring the level on all channels, the audio level of all 
channels can be made the same, solving a long-standing 
complaint against conventional cable systems. 
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Another quality control function is contemplated 
using the standard pilot level on stereo channels. It is 
generally known that the pilot level on stereo channels is 
not automatically set correctly. After a stereo encoder 
5 generates such pilot levels, they are modulated onto the 
audio carrier. In some equipment, the adjustment of the 
pilot level is a factory calibration, and in other equipment 
is a field adjustment. It is possible to add to level 
processor 26', for example, a channel tuned to the 
10 standard 15.734 KHz pilot frequency. The level of the pilot 

could then be measured. Due to the way equipment is 
normally built, it will not be readily possible to 
automatically correct the pilot level, but a visual or 
audible indicator could notify an operator of a problem. 

15 

As suggested above, those skilled in the art are 
aware that measuring audio level is extremely difficult, in 
part by virtue of the fact that normal programming includes 
constantly varying audio levels, possibly covering a very 

20 large dynamic range. For example, a program may include a 

loud argument between two individuals, followed by an almost 
silent scene when one of them runs out of the house and 
through woods. It is desirable in a case such as this, to 
set the deviation during the argument, letting the audio 

25 level during the running scene fall where the director 1 s 

artistic judgment placed it. Adjusting to a louder signal 
(higher deviation) during the running scene, is not 
appropriate. An algorithm for gradually adjusting audio 
level (and for determining the match level of a commercial) 

3 0 is programmed into the software of logic 42 and 





controller 46. Although various algorithms are possible, 
one that experience teaches may be most appropriate with 
respect to system 20 11 of FIGs. 9 and 10 will now be 
described. 

5 

During program content, loudness is periodically 
monitored over several periods and attenuators 65 and 66 are 
adjusted while a program segment runs. A one minute (for 
example) timer in logic 42 starts each time level 

10 detector 40 indicates that the audio level has reached 

reference level LI. If the audio level reaches reference 
level LI more than, for example, five times during a minute, 
the level is turned down by increasing the attenuation of 
attenuators 65 and 66. The adjustment is done in a minimum 

15 control increment so as not to be audible to the listener. 

For example, an increment of 0.5 dB is suitable. On the 
other hand, if the audio does not reach the maximum level at 
all during the minute, the volume is increased by adjusting 
attenuators 65 and 66. The volume is increased by the same 

20 minimum amount, which is not detectable by the listener. If 

another minute elapses without the audio level reaching the 
reference level LI, logic 42 adjusts attenuators 65 and 66 
so that the audio level is again increased by that minimum 
amount. Such an algorithm will result in the volume level 

25 ultimately being set to reference level LI as close as 
possible . 

Another algorithm would monitor the level with 
respect to reference level LI, and any time the audio level 
30 reaches reference level LI, logic 42 adjusts attenuators 65 





and 66 so that the audio level is reduced almost 
instantaneously until it drops below reference level LI. If 
reference level LI is not reached again for some length of 
time, such as one minute, logic 42 adjusts attenuators 65 
5 and 66 so that the audio level increases by the same small 
increment as above. The increase is repeated at intervals 
of a minute or so, until reference level LI is again 
reached. This type of algorithm forms a fast attack, slow 
decay volume control strategy, which is often preferred for 
10 volume control downstream of where the artistic qualities of 
the program are set. In general this is the type of 
algorithm that is favored in this circumstance. 

In certain circumstances the artistic qualities 
15 may be changed by bringing the volume up more rapidly if it 

does not reach reference level LI. This changes the control 
strategy to what is often called "compression." It is 
generally not preferred unless typical listening conditions 
are different from those anticipated during creation of the 
20 program. 

Each audio transmission, be it part of a 
commercial or an entertainment program, has a characteristic 
audio- level signature that may be derived from level 

25 detector 40. FIGs. 11 and 12 illustrate a verification 

system that uses such audio- level signatures to 
automatically determine if the content of the transmission 
corresponds to the content that was intended. This function 
is of particular importance in advertising where it is 

30 desirable to verify that a particular commercial was 





transmitted at the proper time on the proper channel . 

Referring now to FIGs . 11 and 12, verification 
system 109 includes digital sampler 110 which outputs a 
5 unique series of audio level samples over predefined time 

periods for each commercial. Digital processor 111 receives 
these samples from sampler 110 while receiving commercial ID 
data and trigger pulses from insertion controller 46 in 
monitor STEP 120 and data STEPS 121. Processor 111 also 

10 connects to memory 112, correlation detector 113 and 

alarm 114. Generally, processor 111 stores the signatures 
for all commercials that are to be played in a signature 
table in memory 112, via the YES path of decision STEP 122 
and store STEP 123. These signatures are stored, along with 

15 their commercial ID data, the first time that each 

commercial is played by the CTV system, or, alternatively, 
they may be imported to memory 112 from data obtained when 
the commercials are recorded. Processor 111 performs 
corresponding autocorrelations on each of the signatures and 

20 stores the autocorrelation results in the signature table. 

Still further, for each commercial, processor 111 finds the 
standard deviation for the set of samples that make up its 
signature. These standard deviation values, which are 
measures of the extent that the set of samples of each 

25 signature deviates from its mean, are also calculated and 

stored in the signature table during store STEP 123. 

Later, when a particular commercial plays, 
processor 111 stores, in store STEP 125, its commercial ID 
3 0 data and signature in a verification table to be used for 




verification. To verify that a particular commercial that 
was played, say commercial CX, was the desired commercial, 
say commercial CI, processor 111 retrieves the CX signature 
of the unknown commercial from the verification table. 
Processor 111 performs, in calculate STEP 126, a correlation 
of the unknown signature CX with the known signature CI. 
Processor 111 then compares, in decision STEP 128, the 
autocorrelation Cl/Cl with the correlation Cl/CX and the 
standard deviation of commercial CI. If the 
autocorrelation Cl/Cl and correlation Cl/CX are of 
comparable value and high as compared to the standard 
deviation of commercial CI, then there is a strong 
likelihood that commercial CX corresponds to commercial CI. 
However, if the autocorrelation Cl/Cl is high compared with 
the standard deviation of commercial CI while 
correlation Cl/CX is low compared with the standard 
deviation of commercial CI, processor 111 fails to verify 
commercial CX and activates alarm 114. Of course, the 
autocorrelation results of the stored signatures need not be 
stored in signature table, but instead calculation STEP 12 6 
may include the process of obtaining both the desired 
autocorrelation Cl/Cl and the apparent 

cross-correlation Cl/CX before obtaining verification via 
verify STEP 128. 

To maximize system performance, it is contemplated 
further that level detector 40 preferably measure the 
subjective loudness of the audio. This contrasts with other 
measuring options, such as measuring peak audio levels. In 
the instant invention level detector 40 preferably obtains a 





metric of the audio level that correlates well with the way- 
humans subjectively perceive the loudness of audio program 
material . 



signal is a difficult undertaking. One of the earlier known 
standards for doing this is the volume unit (VU) meter 
developed cooperatively by the telephone and broadcast 
industries in 193 9, and still in use today. The VU meter 
standard was developed based on the capabilities of the 
D f Arsonval meter movements available at that time. That the 
standard remains in use today is more of a testament to the 
difficulty of measuring subjective loudness, than it is to 
the validity of the standard as a measure of loudness. 
Those skilled in these arts normally use this standard only 
for general guidance while imposing their own subjective 
judgment as to how loud a signal sounds. 



years used a peak reading meter to measure the loudness of a 
signal. Some authorities propose that the EBU method is 
superior to using a VU meter because it relates nicely to 
the needs of engineers monitoring the peak level of a signal 
for transmission purposes. However, researchers have sought 
better methods for measuring an audio signal that correlate 
with subjective loudness. FIG. 13 shows a schematic of a 
preferred circuit suitable for use as level detector 40 for 
measuring subjective loudness of an audio signal. 



Measuring the subjective loudness of an audio 



The European Broadcast Union (EBU) has for some 




Level detector 40 is a particularly critical 



circuit, in that it must produce a voltage that varies 
proportionally to the subjective loudness heard by a typical 
listener. Level detector 40 , as depicted in FIG. 13, 



curve, such as shown in FIG. 14, with a factor reflecting 
the observation that fast, transitory sound is not perceived 
as loudly as would be the same sound if it lasted longer. 
These concepts are discussed in the following publications: 



and Burden, et al . , "A Different Approach to the Old Problem 
of Audio Level Monitoring, " 84th Convention of the Audio 
Engineering Society, March 1-4, Paris, 1988, pp 1 to 8 . 

5 The A-weighting curve plots response vs . frequency 

resemble 

such that its values closely rcocmble -s the typical response 

A 

of the human ear. Filtering circuits having a response that 
approximates the A-weighting curve have been used in audio 
level measurements as a means of pre- filtering a signal such 

0 that subsequent detection of volume level will be roughly 

equal to the frequency response exhibited by the human ear. 
Further, it is known that humans generally perceive loudness 
partially as a function of duration of the sound, with 
transitory signals sounding progressively louder until the 

5 duration exceeds about 200 micro- seconds . Beyond this 

period, the perceived loudness no longer varies as a 
function of duration. 



5 



combines a close approximation of a standard A-weighting 



0 



Benson, Audio Engineering Ha n db oo k , 1988, pp 1-38 to i-39 ; 



0 



FIG. 13 depicts the specific circuit elements for 
an implementation of level detector 40 that was tested and 




compared to an A-weighting curve as shown in FIG. 14. The 
circuit schematic of FIG. 13 places parameter values for the 
various elements adjacent the element symbols. In FIG. 13, 
capacitance values are given in microfarads and resistance 
5 values are in ohms. The reference markings adjacent 

diodes D1-D4 and amplifiers U1A, U1B, U1C, U1D and U2A are 
conventional identifications of these elements. 
Transistor Ql is a conventional NPN transistor. 

10 The audio signal monitored by level detector 40 

appears at input terminal 140. Amplifier U1A and the 
associated elements constitute an active band-pass 
filter 141. Generally, filter 141 is a conventional circuit 
with the addition of capacitor C3 which creates a portion of 

15 the rolloff required by the A-weighting curve. From 

filter 141, the signal passes to low-pass filter 142 
comprising amplifier U1B and the associated components. 
Again, this circuit is known to those skilled in the art. 
High-pass filter 143, comprising capacitor C5 and 

20 resistor R8, provides a portion of the rolloff at low 

frequencies required by the response of the A-weighting 
curve . 

The output of high-pass filter 143 passes to full 
25 wave rectifier 144, consisting of amplifiers U1C and U1D and 

their associated components. Again, this circuit is 
familiar to those skilled in the art. The output at pin 8 
of U1C is a full wave rectified representation of the audio 
signal at input terminal 140 after it has passed through 
30 filters 141-143. The R-C circuit, formed by resistor R16 



and capacitor C6, limits the attack time of the circuit to 
limit the effect on the audio level of a momentary high 
level sound. The time constant of the R-C circuit is much 
shorter than the maximum duration proposed by the Benson 
publication cited above. This is done to allow a certain 
degree of fast attack characteristic to pass to the 
measuring facility. 

Finally, dB circuit 145, comprising amplifier U2A 
and associated components, converts the output of 
rectifier 144 to decibel or logarithmic representation. 
This circuit is also known to those skilled in the art. 
Output terminal 146 passes the signal to comparison 
amplifier 32. FIG. 14 shows the A-weighting curve as a 
solid line and the test values of the filter response of 
level detector 40 as X's. As can be seen from inspection, 
there is a close correlation between the responses of the 
A-weighting curve and the test results of the FIG. 13 
implementation of level detector 40. 

It is to be understood, therefore, that within the 
scope of the appended claims, the invention may be practiced 
otherwise than as specifically described. 




